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Abstract 

Background: The Sialyl-Lewis X (Sle x ) is a well-known glycan structure involved in leukocyte homing and 
recruitment to inflammatory sites. SLe x is well conserved among species and is mainly synthesized by FucT-VII in 
vertebrates. The enzyme responsible for its biosynthesis in cattle was not known. 

Results: We cloned a cDNA sequence encoding bovine a3-fucosyltransferase VII that shares 83% identity with its 
human counterpart. Located at the BTA 1 1 telomeric region, the 1029 bp open reading frame is spread over two 
different exons, E1 which also contains the unique 5'-untranslated region and E2 which includes the entire 
3'-untranslated region. The bfut7 expression pattern is restricted to thymus and spleen. A single transcript leading to 
the synthesis of a 342 aa protein was identified. The encoded fucosyltransferase, produced as a recombinant 
enzyme in COS-1 cells, was shown to be specifically responsible for SLe x synthesis in cattle. In addition, we showed 
that the gene promoter evolved from fish to mammals towards a complex system related to the immune system. 
But beyond the fact that the gene regulation seems to be conserved among mammals, we also identified 7 SNPs 
including 3 missense mutations in the coding region in a small panel of animals. 

Conclusions: The FUT7 sequence was highly conserved as well as the specific activity of the encoded protein 
FucT-VII. In addition, our in silico promoter analysis and the high rate of polymorphism suggested that its function is 
evolving toward a complex system related to the immune system. Furthermore, comparing bovine to human and 
mouse sequences, it appeared that a decrease in gene regulation was correlated with an increase in mutation rate 
and wider tissue expression. 
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Background 

Located at the non-reducing end of glycans comprised 
of glycoproteins (N- and O-linked) and glycolipids, a3- 
fucosylated oligosaccharides form the Lewis related 
carbohydrate antigen family (i.e. Le x , Le y and Sialyl-Le x ), 
one of the major components of cell surface oligosac- 
charides. One of them, Sialyl-Le x , is involved, due to its 
external position, in numerous cell-cell interactions oc- 
curring during embryo implantation [1], malignant 
transformation [2] and inflammation [3]. 

For example, leukocyte homing or recruitment to in- 
flammatory and tissue injury sites involve leukocyte 
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rolling along the endothelial wall [4,5], binding to endo- 
thelial cells near the inflammation site via integrin adhe- 
sion [6] and then migration into the underlying tissue 
[7]. The initial rolling step is mediated by the interaction 
of leukocyte glycoproteins containing active moieties 
such as Sialyl-Lewis x with selectins expressed on endo- 
thelial cells [4]. SLe x expression is constitutive on granu- 
locytes and monocytes but not on lymphocytes [8]. 
Indeed, SLe x expression on lymphocyte is the result of 
the transcriptional activation of genes coding for fucosyl- 
transferases mainly involved in the last step of its 
biosynthesis. 

Fucosylation is catalyzed by a3-fucosyltransferases 
belonging to CAZY family 10 (http://www.cazy.org/fam/ 
GT10.html) and especially by FucT-IV and mainly FucT- 
VII [9,10]. Both enzymes are Golgi resident type II gly- 
coproteins [11,12] that catalyze the transfer of fucose in 
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an a3-linkage from GDP-Fuc to N-acetylglucosamine of 
N-acetyllactosamine (Galpl,4GlcNAc). These linkage 
and substrate specificities are due to the presence of 5 
conserved motifs in the catalytic domain that are very 
important for substrate binding and the catalytic mech- 
anism. Largely described in humans and mice [13], the 
a3-fucosyltransferase coding sequences are present in 
only one exon whereas other genes of its family gener- 
ally include unstranslated exons. Furthermore, the 
encoded proteins are well conserved beyond species. We 
can distinguish two types of a3-fucosyltransf erases, 
those with a low polymorphism rate such as FucT-IX 
and the others with a higher rate like FucT-III and 
FucT-V associated with an elevated evolution rate [14]. 

The aim of this paper was to assess the variability of 
FUT7 in mammals. In spite of its involvement in funda- 
mental processes, known mutations in humans and mice 
are associated with a loss of FucT-VII enzyme activity 
but do not lead to critical phenotypes [15]. First we 
wanted to make a comparative analysis of gene struc- 
ture, expression pattern among vertebrates and study 
the intraspecific variation of its sequence. These features 
are not independent as genes that evolve faster have a 
large number of transcripts whereas more conservative 
genes seem to be associated with the presence of only 
one transcript [16]. In contrast to other a3-fucosyltrans- 
ferases genes (FUT3,4,5,6, and 9) where the coding se- 
quence is monoexonic, the corresponding regions of 
human and mouse FUT7 are split into two exons. 

FUT7 is well studied in humans and mice. Our study 
focused on the bovine gene since the a3- 



fucosyltransferase genes remain poorly documented in 
cattle with the exception of futb [17]. NCBI mapviewer 
website displayed a potentially encoded protein 
(DAA24060) 33 amino acids shorter (309 amino acids 
instead of 342) than that of other mammals. Surpris- 
ingly, the lacking region was localized at the end of the 
catalytic domain. 

Here we propose a complete description of the bo- 
vine gene including its structure, transcripts and ex- 
pression pattern. In addition, we characterized this 
sequence using phylogeny and synteny approaches as 
well as the function of its product by enzyme activity 
tests. 

Results 

Molecular cloning of potential bovine FUT7 cDNA 

Using a partial bovine FUT 7 gene model (generated by 
the Bovine Genome Sequencing Project) as a template 
for PCR amplification, we characterized the complete se- 
quence of the bovine FUT 7 gene. Thus, the coding re- 
gion of cloned bfut7 is made of 1029 bp and encodes a 
342 amino acid Type II glycoprotein that respectively 
shares 79% and 74% of identity with its human and 
mouse counterparts. This protein contains the 5 specific 
domains of a3/4-fucosyl transferases (Figure 1) involved 
in substrate binding (motifs I to III) and enzyme 
mechanisms (motifs IV and V). Motif II possesses the 
HHRE sequence specific for al/3-fucosylation activity. 
All motifs were strictly conserved compared to other 
vertebrates. Moreover, we identified a new motif (motif 
VI) downstream from motif V (Figure 1A) which is 
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Figure 1 Structure of bovine FucT-VII. A, Bovine fucosyltransferase VII. TMD represents the transmembrane domain. Ml to MVI represent the 
a 1-3/4 fucosyltransferase specific motifs. The logos show amino acid conservation between a 1-3/4 fucosyltransferases. B, amino acid conservation 
in a1 -3/4 fucosyltransferase motif VI between species. 
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common to the vertebrate a3/4-fucosyltransferase family 
(Figure IB). 

Characterization of bovine fut7 and its product 

To assess the nature of the new bovine FUT, three inde- 
pendent analyzes were undertaken. Molecular phylogeny 
inferred by Maximum Likelihood and involving 14 
amino acid sequences and 276 sites was conducted. All 
positions containing gaps and missing data were elimi- 
nated, and there were a total of 263 positions in the final 
dataset. The bovine sequence appeared to be belong to 
the FUT7 group with high bootstrap values (98% 
with Human FUT7 and 99% with Tetrapod FUT7 
(Figure 2A). 

To verify the bfut7 position in the bovine genome, we 
screened a bovine whole-genome radiation hybrid panel 
[18] by PCR. Amplification of a 278 bp genomic DNA 
fragment mapped bfut7 to the telomeric region of 
BTA11 between RSJW577 and BMS1350, two microsat- 
ellite markers respectively positioned at bases 



106.193.598 and 106.646.683. ABCA2, C9orfl72, and 
PTGDS genes are also found in this interval. Using 
Ensembl database, a synteny analysis within mammals 
gave a block of 6 genes surrounding FUT7, common to 
cows, dogs, mice, and humans (Figure 2B). Thus, the bo- 
vine gene has the same gene environment as other 
known mammalian FUT7 genes. 

To check whether bovine bfut7 cDNA produced an ac- 
tive enzyme, the open reading frame was inserted into a 
mammalian vector and then transiently transfected in 
COS-1 cells. The ability of the recombinant enzyme to 
transfer fucose was evaluated using either sialylated or 
not Type I-octyl oligosaccharides, sialylated or not Type 
II-octyl oligosaccharides and H-Type II-octyl oligosac- 
charide. FucT-VII was specifically responsible for SLe x 
biosynthesis in cattle (Table 1) because highly significant 
activity was mainly found with sialylated Type II sub- 
strate. In contrast, like the other tested enzymes, recom- 
binant Fuc-TVII did not transfer fucose on Type-I 
substrates. 
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Figure 2 Position of bovine FUT7 by two independent methods. 2A. Molecular Phylogenetic analysis by Maximum Likelihood method based 
on the matrix-based model [1]. The tree with the highest log likelihood (-5230.5756) is shown. The percentage of trees in which the 
associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences 
among sites (5 categories (+G, parameter = 1.4694)). The rate variation model allowed some sites to be evolutionarily invariable ([+I], 14.8207% 
sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. 2B. Synteny shared by Conis fomiliaris, Bos 
taurus, Homo sapiens, and Mus musculus around FUT7. The positions of the genes (in bp) ending each chromosome segment are indicated above 
and below. The double arrows correspond to microsatellite markers experimentally determined in BTA1 1. 
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Table 1 Activity of bovine a1-3fucosyltransferases 



Acceptors 


bFucT-VII 
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bFucT-IX* 


Gaipi-3GlcNAc-octyl 


0 


0 


0 


NeuAca2 7 3Gaipi ,3GlcN Ac 


0 


0 


0 


Gaipi-4GlcNAc-octyl 


40 


885 


904 


Fuca1,2 Gaipi,4GlcNAc 


0 
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1068 


NeuAca2 7 3Gaipi ,4GlcN Ac 


1108 


201 


43 



*CJavaud PhD (2002). 

Recombinant FucT-VII, Fuct-IV* and FucT-IX were produced in COS-1 cells. 
Arylglycosids were used at a final concentration of 1 mM. Results are 
expressed as pmol fucose transferred by hour and mg proteins. 



Organization of the FUT7 gene 

To determine gene structure, a 1135 bp fragment was 
amplified by PCR from genomic DNA. After sequencing, 
comparison with the sequence obtained from thymus 
cDNA showed that the ORF was comprised of 2 exons, 
El and E2. Exon El contained the first 13 bp of the cod- 
ing sequence and E2 included the last 1016 bp of the 
coding region. To complete the information concerning 
the 3'UTR sequence, we took into account the WGS 
contig AAFC01684977. Taken these data together, we 
established that bfut7 gene was composed of two exons 
El (369 bp) and E2 (1503 bp) including the 507 bp 
3'UTR with the consensus sequence AATAAA 481 bp 
downstream from TGA (Figure 3A). El was separated 
from exon E2 by an intronic 92 bp sequence. 

When comparing FUT7 gene organization, it appears 
that in gray short-tailed opossum (Monodelphis domes- 
tied), chicken (G alius gallus) and western clawed frog 
(Xenopus tropicalis), the whole coding sequence is 



included in only one exon that corresponds to the gene 
sequence. A similar organization is observed in zebrafish 
(Danio rerio) with the exception of an upstream short 
untranslated exon (Figure 3D). In Eutherians, the coding 
sequence is split into two exons. In mammals, the emer- 
gence of the intron occurred at the base of Eutherians 
(Figure 4), since we identified an intronic sequence, 
similar to those observed in human and cattle 
(Figure 3A and B), in Loxodonta africana (data not 
shown) whereas no introns were retrieved in M. domes- 
tica. The main exon has always the same coding size 
(1016 bp). It should be noted that the ancestor of rats 
and mice gained another intron upstream from the pre- 
vious one (Figure 3C). 

bfut7 expression in different bovine tissues 

In order to determine the bfut7 expression pattern, we 
performed a RT-PCR using forward and reverse primers 
localized in two different exons, respectively exons 1 and 
2, to avoid genomic DNA contamination. Among the 
eight tested tissues, we detected bfut7 transcripts in thy- 
mus, spleen, liver and lung (Figure 5). It was most highly 
expressed in the thymus and was lower in the lung. Fi- 
nally, lung is the tissue where the gene expression is the 
weakest. Conversely, we did not detect any bfut7 tran- 
scripts in brain, mammary gland, colon and kidney. 
Moreover, 5'-RACE analyses done in tissues expressing 
the gene revealed a single transcript with a 356 base 
5'-UTR. 

In connection with tissue expression, in silico searches 
for TFBS (transcription factor binding sites) were 
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Figure 3 FUT7 gene organization. Structure of bovine (A), human (B), mouse (C) and Zebrafish (D) genes. Grey bars represent the open 
reading frame while white bars indicate the gene untranslated regions. The size of exons and introns (italic) are indicated in base pairs. Dotted 
lines indicate a possible splicing event during the transcription. Black circles show gene promoters. 
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Figure 4 Variation of transcription factor binding sites in FUT7 promotor regions in Vertebrates. Transcription factors able to bind to the 
FUT7 promotor region are placed on the branches of the phylogenetic tree (horizontal bars) according to their appearance. Circles indicate the 
loss of DNA sites to which the transcription factors can bind. Triangles point out the split of FUT7 due to the insertion of intronic sequences into 
the coding region. 



undertaken as the pattern of the bovine gene expression 
seemed to be more widespread than its human and 
mouse counterparts. Thus, during vertebrate evolution, 
it seems that two TFBS, i.e., ETSF and SMAD, have 
been conserved among the studied species (Figure 4). 
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Figure 5 6fuf7gene expression in different bovine tissues. The 

gene expression pattern was determined by RT-PCR using mRNA 
from 8 different bovine tissues. Forward primer (fut7f) was chosen in 
exon E1 and reverse primer (fut7r) in exon E2 in order to amplify a 
186 bp specific cDNA fragment. A GAPDH fragment of 90 bp was 
amplified as a control. 



EBOX could be also considered as ancestral in verte- 
brates but a greater number of sequences should be 
added to test this alternative model. The number of 
transcription factors able to bind in the 1000 bp up- 
stream from the first exon increases with species evolu- 
tion. Indeed in most Eutherians, the FUT7 expression 
seems to be regulated by at least nine transcription fac- 
tors, five of them being common to humans, mice, 
horses and cattle. However, the bovine ancestor has 
probably lost two of them (EBOX and GATA). In other 
vertebrates, a lower amount of transcription factor bind- 
ing sites has been found and varies from 2 in Gallus 
gallus to 4 in Danio rerio. 

Intraspecific variation of bfut7 

Since mutations have been described in humans and 
mice to lead to the loss of enzyme activity and inflam- 
matory diseases, we investigated gene variations by se- 
quencing genomic DNA from 30 animals belonging to 
six different cattle breeds (Blonde dAquitaine, 
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Charolaise, Holstein, Limousine, Montbeliarde and Nor- 
mande). We identified seven SNPs in the coding region 
of bfut7 at bases 58, 95, 213, 335, 501, 792 and 831 
(Table 2). Among these mutations, three were missense 
(58, 95 and 335). At position 58, substitution of a guan- 
ine by an adenine induces the translation of a methio- 
nine instead of valine 20. At positions 95 or 335, the 
SNP G— »A changes R 32 or R 112 in Q. Surprisingly, in the 
five Holstein animals, no polymorphisms were found 
while in the other breeds we always identified mutations 
in spite of the relatively low number of samples. 

Discussion 

Terminal fucosylation is involved in a large number of 
processes such as cell migration and leukocyte recruit- 
ment to inflammatory sites [10]. This process implies 
the presence of a specific glycan structure, sialyl-lewis X 
mainly synthesized by the Fuc-TVII enzyme. The gene 
encoding for this protein, fut7, is widespread among 
other vertebrate species [14]. To date, no evidence for 
an active glycosyltransferase synthesizing the sialyl-lewis 



X motif has been identified in cattle although the struc- 
ture is present [19], and surprisingly the reference se- 
quence in NCBI data bank corresponds to an enzyme 
truncated in its C-terminal catalytic domain. In this 
study, we describe the bovine gene, bfut7 that encodes 
an active enzyme responsible for synthesis of sialyl- 
Lewis X motif. 

This gene spans two exons separated by a short in- 
tronic sequence (Figure 3) and is located in the telo- 
meric region of BTA11, a region homologous to HSA9, 
MMU2 and CFA9 (Figure 2) which contains the human, 
mouse and dog FUT7 gene [20], supporting UMD3.1 as- 
sembly [21]. Unexpectedly, this location is not related to 
a known QTL associated with resistance to diseases such 
as mastitis [22] whereas its product recognized mainly 
sialyl type II as a substrate and thus elaborated the 
sialyl-lewis X motif (Table 1) involved in leukocyte 
recruitment. 

Only one transcript containing both exons was found 
in tested tissues. The expression pattern was similar to 
that observed in other mammals for thymus and spleen 



Table 2 Polymorphism analysis of bfut7 in cattle breeds 



Haplotype 



Base position in the coding sequence 



Number 
of animals 
with the 
polymorphic 
sequence 



Breeds 

Holstein* 

Blonde 
d'Aquitaine 



Charolaise 



Limousine 



Montbeliarde 



Normande 



Position of 
amino acid and 
substitution 



58 

G/G 
A/G 
A/G 



95 

G/G 



A/A 



A/A 



A/G 



20 aa V — » M 32aa R 



213 

C/C 



C/T 
T/T 



C/T 
T/T 
T/T 
C/T 
C/T 
C/T 
C/T 
C/T 
T/T 
38 aa None 



335 

G/G 



A/G 

A/G 
A/A 



A/G 
A/G 

A/A 
A/G 



A/G 
A/G 
112 aa R 



501 

G/G 



792 

C/C 



831 

C/C 

C/T 



C/T 



G/C 



G/C 



G/C 



Q 167 aa None 264 aa None 277 aa None 



Data were obtained from 6 cattle breeds. (Blonde d'Aquitaine, Charolaise, Limousine, Holstein, Normande and Montbeliarde). For each breed, 5 animals were 
analysed. * The haplotype found for the Holstein breed is also that of the reference sequence (DQ339142.1). 
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[23] whereas its expression in lung and liver seems to be 
a bovine specificity. Indeed, the hypothesis of lung ex- 
pression due to the presence of leukocytes is unlikely be- 
cause the individuals were considered as healthy by 
animal control Until now, FUT7 expression in lungs has 
not been described in lung epithelial cells, and when a 
sialyl-Lewis X motif was observed at these cell surfaces 
in humans, it was correlated to FUT4 expression [24]. 
Liver expression was described in human liver, but only 
in particular cases such as liver regeneration [2] . In mice, 
only one EST from liver is found in the Unigene 
database. 

We also observed a change in gene organization asso- 
ciated with the evolution of the promoter region. The 
monoexonic structure including coding and 5 and 3' un- 
translated sequences found from D. rerio to G. gallus, 
was split in Eutherians (bovine and human) into at least 
two exons, one of them bearing a short coding sequence. 
In contrast, other al,3/4 fucosyltransferase genes 
described in vertebrates only have a monoexonic coding 
sequence [25]. Exon 1 possesses the 13 first nucleotides 
of the coding sequence in both species (Figure 3), also 
found in mice and rats, but corresponding to exon 3. In 
these species, another exon (E2) contains a 154 bp se- 
quence coding for the first Fuc-TVII amino acids, and 
leads to an alternative longer form (52 amino acids) of the 
protein [26]. This is the result of alternative splicing and 
from the use of two promoter regions that generates 4 
transcripts while only one is found in other vertebrate fut7 
genes. A search for putative transcription factor binding 
sites showed that in mice two regions contain the sites, 
one upstream from El and corresponding to the long 
form of the protein and another upstream from the 67 bp 
E3 giving rise to the same protein as observed in humans 
and cattle. Only one binding site for E-Twenty Six Factor 
(ETSF) was found in all species studied from D. rerio to 
H. sapiens. It has been described to participate in differ- 
entiation, maintenance and function of the immune sys- 
tem [27]. A greater number of transcription factors 
seemed to regulate FUT7 transcription in M. musculus, 
B. taurus and H. sapiens (Figure 4). This may be 
explained by cis-clusters including ETSF, SP1F, STAT, 
EBOX and CREB that occur in activated T-cells, and by 
CREB, NFKB and GATA that are involved in stimulated 
lymph nodes [27]. It has been also shown that CREB in 
association with another transcription factor regulates the 
activation and proliferation of macrophages [28], and that 
the SMAD signaling pathway plays a role in 
hematopoiesis and translational hematology [29]. A bind- 
ing site for GATA was observed only in humans and 
mice. This transcription factor regulates the cell- 
lineage- specific expression of lymphocyte homing 
receptors [30]. These data strongly suggest that the 
FUT7 promoter region became more complex and 



could related to the immune system evolution in verte- 
brates and with the phylogenic tree of Fuc-TVII [14]. 
Surprisingly, no significant binding sites for the last 
cluster have been described in horses and cattle sug- 
gesting another type of regulation of the fut7 gene in 
lymph nodes for these species. 

Amino acid sequence alignment of vertebrate Fuc- 
TVII shows a strong identity within the five conserved 
motifs previously described [31,32]. However, compared 
to Fuc-TIV and Fuc-TIX, two enzymes implicated in 
lewis X motif synthesis, this identity is lower since be- 
tween vertebrate species, 98% homology was observed 
for Fuc-TIX [23]. For Fuc-TVII the larger differences are 
observed near motifs I and IV, respectively involved in 
substrate binding and enzyme activity. Indeed, the 
amount of proline residues is especially high in mam- 
mals (Figure 6) and maybe correlated to a change in spe- 
cification of acceptor substrates. It should be interesting 
to further investigate the role of these numerous pro- 
lines in Xenopus tropicalis motif I and Anolis carolinen- 
sis motif IV (Figure 6). 

In the region implicated in the catalytic mechanism 
after motif V, we identified a new motif (motif VI) con- 
taining 16 amino acids. A short and well conserved se- 
quence HVD/ED was observed at the end of this motif. 
Glutamic acid residue was found in Fuc-TIX, an enzyme 
that synthesizes mainly Lewis X and Lewis Y (Table 1) 
whereas an aspartic acid residue was found in the other 
enzymes that synthesize sialyl-Lewis. bFuc-TVII pro- 
duces only sialyl-Lewis X and, like its human and mouse 
counterparts, preferentially recognizes Sialyl-LacNAc as 
a substrate. We suggest that the presence of aspartic 
acid in this short sequence leads to preferential recogni- 
tion of sialylated acceptor substrates. 

Fuc-TIV and Fuc-TIX are implicated very early in de- 
velopment [33,34] and until now only a very small num- 
ber of mutations have been described for FUT9 in data 
bases. This precludes its crucial function. In contrast, in 
humans, a large number of mutations has been 
described for FUT7 with a frequency around 1%. Among 
them, one induces the loss of enzyme activity associated 
with the appearance of diseases such as arthritis [16]. In 
bovines we identified seven polymorphisms after sequen- 
cing the bfut7 coding sequence in 30 unrelated cattle. 
Three were missense mutations. Surprisingly, no muta- 
tions were found in a diary breed (Holstein), whereas 
they were detected in beef breeds (Blonde dAquitaine 
and Limousine) and in dual-purpose breeds (Montbe- 
liarde and Normande). More intense selection may have 
further reduced levels of genetic variation in Holsteins 
[35] and could explain the absence of polymorphism in 
this breed. Compared to humans and mice, a high poly- 
morphism rate between animals from the same breed 
without notable changes in phenotype could be related 
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TMD Motif I 

H . sapiens WL LWLLGSAP RGTPAPQPTITILVWHWPFTDQPPELPSDTCT-R 

M.musculus WF FWLWGSAP GSAPVPQSTLTILIWHWPFTNRPPELPGDTCT-R 

B . taurus WL WWRLGAAP GGAPAPQPTITILVWHWPFASQPPELPGDTCT-R 

C . familiaris WL LWLLGSPP TGGPAPPHTLI ILIWHWPFSDRPPELPSNTCI-S 

M . domestica WLLIAHFLSSSRPWDTSGVLQPHSKLTVLIWHWPFNKSLP— LDGDVCA— R 

C bellll ALRHSEPLTVLIWQWPF— RHTPNLSTDVCADR 

A. carolinensis SLALASGDPLTVLVWEWPDG— QTPDLSADVCSRL 

T . guttata WTLRFFLNPSKGS GEDSKHREPLVILVWEWPS— KQVPNVSRDVCHEL 

G.gallus WNLKCFIRSSNSS DEALKHTKVLTVLVWEWPS-RQVPDLSGDVCREQ 

X . tropicalis LFWNSSFVLISPP GVAQGPPDPLTILIWHWPF— HRPMDLSGNVCLDR 

S . salar Y GWGPYS DRSH I ITGNLT ILLWHWPFGPSYS -LEGDVCWDK 

D . rerio LPLIGEQHLN-SSNASFTNNITILLWHWPFGMRYN-LTGDVCLRD 

O .mordax HIS K QGW LQV E PL YTNQ DT I GNG S FLT I L VW YW P FG Y R YP — LE GDV CKDQ 



-Motif IV- 



H . sapiens 
M.musculus 

B . taurus 

C . familiaris 
M - domestica 



FNWVLSYRRDSDIFVPYGRLEPHWGPSP- 
FNWVLSYRRDSDIFVPYGRLEPLSGPTS- 
FNWVLSYRRDSDIFVPYGRLEPREGPAP- 
FNWVLSYRRDSDIFVPYGQLEPHQGPAP- 
FTWVMSYRRDSDIFVPYGELVPRAGPDL- 



-PLPAKSR-VAAW 
-PLPAKSR— MAAW 

- PL PAK RG- LAAW 

- PL PAKKR— VAAW 
-PVPAKNR-TAAW 



< 



A 
C 
T 
G 

s 

D 

O 



. carolinensis 
bellii 
, guttata 
, gallus 



WTWVMS YRRDADI FMPYGKLVPRP PGDG EWIPPKSG-LWW 

FNWVMT YRQDSDIFMPY GEL VRH SASV DIPTKTG-LVSW 

FNWVMT Y RQD S D I F I PYGKLVPNR S AT V NI PAKTN— LVSW 

FNWVMT Y RRD S D I FMP YGKL VPNR S AT V SI PTKTN- LVSW 



tropicalis 
salar 
rerio 
mordax 



FNWTLyyUTUyUlirVPyGMLVPyPAKVP NLPNKTG-yvyW 

FNWTMS YRRDADFT I PYGKL VPKE S DS GNY S STE DVFPKNKST LACW 

FNLTMSYHPGADITVPYGKLLLR-TNPG LDVI VPKS KRYEACW 

FNWTMT YRRDAD I P I PYGEL I RKQAVN S S S DS I E DS I TQNKSKLVCW 



Figure 6 Alignment of the region surrounding the FucT-VII motifs I and IV in mammals, birds, lizards, amphibia and fish. Prolines are in 
red characters and position of transmembrane domain (TMD) and Motif I are indicated. Mammals are boxed in blue and include humans (Homo 
sapiens; AAH86312), mice {Mus musculus; NM_013524), cows [Bos taurus), dogs [Canis familiaris; NP_001 005379) and possums {Monodelphis 
domestica; XP_00 1374334). The Sauropsidae are boxed in orange and include Carolina lizards {Anolis carolinensis; WGS AAWZ02039323.1 
Cont2.39322), Painted turtles (Chrysemys picta bellii; WGS AHGY01 431 622.1 Contig440.209), chickens (Gallus gallus; BAB82491) and Zebra-finch 
[Taeniopygia guttata; XP_0021 92597). The last four sequences correspond to the western clawed frog [Xenopus tropicalis; AY692026), Atlantic 
salmon {Salmo salar; NP_001 133942.1), zebrafish {Danio rerio; AY788991), and rainbow smelt {Osmerus mordax; ACO08841). 



to the greater number of tissues where bfut7 is expressed. 
In bovine, the encoded enzyme could act on a richer pro- 
tein repertoire. This specificity could have consequences 
for modulation of the immune response, known to be 
complex in bovines, required for control of the substan- 
tial load of microorganisms present in the rumen and 
close contact in the herd that promotes rapid disease 
transmission [36]. 

Conclusions 

We can conclude that the bfut7 gene encodes an enzyme 
that mainly ensures the synthesis of the sialyl-lewis X 
motif. Regulation of its expression seems to be more and 
more specific and complex from fishes to mammals. In- 
deed, in mammals, expression mainly depends on tran- 
scription factors associated with the immune system 
response. Whereas its implication in leukocyte recruit- 
ment is well documented, the relatively large amount of 
polymorphism depicted in few animals correlated well 



with a gene extending its abilities in terms of tissue ex- 
pression and target proteins in bovines. 

Methods 

Materials 

Oligonucleotides were synthesized by MWG (Germany). 
The 2X pre-aliquoted PCR master mix was from 
Abgene. GDP- [ 14 C]Fucose (310 mCi/mmol) was from 
Amersham Pharmacia Biotech (USA). Dulbeccos modified 
Eagle s medium (DMEM) with 4.5 g/L glucose and fetal 
calf serum were from Eurobio (France). L-Glutamine, 
nonessential amino acids, antibiotics and trypsin-EDTA 
IX were from Invitrogen Life Technologies (France). 
Sialyl-type I and II oligosaccharides were provided by 
Lectinity (Russia). 

Molecular cloning of bfut7 cDNA 

A WGS contig (AAFC01684977) was identified by 
BLAST analysis of the bovine genome sequence 
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databases with human FUT7 (U11282) and mouse fut7 
(AAC52484) mRNA sequences. High similarities were 
found with the last 410 bp of the coding region and 
oligonucleotide primers were designed, fut7fx and fut7rx 
(Table 3). Moreover, a primer whose sequence recovers 
a highly conserved region in human and mouse was also 
designed (fut7f2). For amplification of the bovine fut7 se- 
quence, total RNA were extracted from 250 \ig bovine 
tissues with the Qiagen RNeasy Midi kit. Then, a reverse 
transcription was performed with 10 \ig of total RNA 
using the High capacity cDNA Archive Kit (Applied Bio- 
system, Foster City, CA) with random primers. From the 
obtained cDNA, PCR amplifications were done using 
primers fut7f 1 or fut7f2 and fut7r 1 (Table 3). The 25 (iL 
PCR mixture contained 1 unit of DyNazyme EXT, 
0.4 (iM of each primer, 0.2 mM of dNTP, 0.2 \ig of 
cDNA and 10% DMSO (v/v). Reactions were run under 
the following conditions: a first denaturation step of 
7 min at 94° C followed by 40 cycles, each of them in- 
cluding 1 min at 95°C, 1 min at 62°C, 2 min at 72°C, 
and a final extension of 7 min at 72°C. The whole coding 
region was obtained after 5'-RACE analysis which 
was run following the instructions of the 573'RACE Kit, 
2 nd generation (Roche, Mannheim, Germany). For that, 
primer fut7r 3 was used for the total RNA reverse tran- 
scription and primer pairs OligodT-Anchor primer /fut7r 4 
and PCR anchor primer /fut7r 5 (Table 3) were used for 
the first and nested PCR respectively. All PCRs were 
performed with 8% DMSO (v/v). PCR programs were 
conducted at an annealing temperature of 60°C. All 
PCR products were subcloned into the Topo TA 

Table 3 Oligonucleotide primers used in this paper 



Name Primer sequence 



fut7f 1 


5'-CCTGTGCGCCAGCTGCOTCTG-3' 


fut7f 2 


5'-CACCATCC1TGTCTGGCACTGG-3' 


M7f? 


5'-GAGAATTCAGGAATGCAGAATGCTGG-3' 


fut7f 4 


5'-GCCGTGAATGCAGAATGCTGG-3' 


fut7r 1 


5'-GCAGCGCTCAGGCCTGGAACC-3' 


fut7r 2 


5'-CAGAAGGCAGCTGGCGCACAGG-3' 


fut7r 3 


5'-GCTCAGCCAGGGGCGGGCG-3' 


fut7r 4 


5 '-GGCGTCGGCGCCGGCC AGC AGG-3 ' 


fut7r 5 


5'-GGCTGGCTGGCAAATGGCCAGTGCC-3' 


fut7r ( 6 b) 


5'-CTGGTACCCACAGCTCAGGCCTGGA-3' 


Oligo 


5'-GACCACGCGTATCGATGTCGAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IV-3' 


d(T)-Anchor 




primer 




PCR Anchor 


5'-GACCACGCGTATCGATGTCGAC-3' 


primer 




GAPDHf 


5'-GTGAAGCAGCGCTCAGAGG-3' 


GAPDHr 


5'-TOAAGTCGCAGGAGACAACC-3' 



*in bold restriction site (a) EcoR\ and (b) Kpn\. 



sequencing vector (Invitrogen, Carlsbad,CA). Sequen- 
cing reactions were run with the Big Dye Terminator 
vl.l kit (Applied Biosystem) on a 3100 genetic analyzer 
(Applied Biosystem). Sequence comparisons were done 
using Align Sequence Nucleotide BLAST on the NCBI 
site using the bovine fut7 sequence DQ339142.1 as a 
reference. 

Tissue expression, organization and chromosome 
localization of bfut7 

Tissues were collected from Limousine cattle at the 
slaughterhouse. The presence of fut7 transcripts in bo- 
vine tissues was detected by PCR using cDNA, prepared 
as described in Molecular cloning of the bfut7 cDNA, 
under the following conditions: 12.5 \iL pre-aliquoted 
2X PCR master mix (Abgene, Courtaboeuf, France), 
0.3 \iM primers (fut7f 4 /fut7r 5 ) Table 3), 0.2 \ig cDNA 
and 10%DMSO (v/v) in a final reaction volume of 25 \iL. 
The PCR program included the following steps: 3 min of 
denaturation at 94°C, then 45 cycles of 1 min at 94°C, 
60°C for 30 s, and 45 s at 72°C followed by a final exten- 
sion step of 3 min at 72°C. For GAPDH, primer pair 
GAPDHf /GAPDHr (Table 3) was used at 0.3 [iM in a 
25 \iL reaction including 12.5 \iL pre-aliquoted 2X 
PCR master mix (Abgene) and 0.2 \ig cDNA. The cyc- 
ling conditions included an initial incubation at 94°C 
for 3 min followed by 37 cycles of 30 s at 94°C, 30 s at 
56°C, and 30 s at 72°C and then a final extension step at 
72°C for 5 min. Touchdown PCR conditions were used. 
The annealing temperature was decreased from 62°C to 
56°C and so by the loss of l°C/cycle during the seven first 
rounds of the PCR. The annealing temperature was then 
maintained constant at this temperature for the next 
cycles. 

Same PCR reactions using primer pair fut7f4/fut7r s 
(Table 3) were performed on 0.1 [ig genomic DNA to 
determine gene structure or to screen the whole genome 
radiation hybrid panel described by [17] to determine 
bfut7 chromosome localization. 

Polymorphism analysis 

Polymorphism was examined in 30 animals coming from 
6 different cattle breeds (Blonde dAquitaine, Charolaise, 
Holstein, Limousine, Montbeliarde and Normande). Five 
animals were taken per breed. Blood from these animals 
was collected at the slaughterhouse. Genomic DNA was 
extracted from blood cells using QIAGEN-DNeasy 
Blood and Tissue Kit. Then, fut7 was amplified as 3 frag- 
ments using primer pairs, fut7f4/fut7r s , fut/flfut/r^ 
fut7f2lfut7r 1 under PCR conditions described in molecu- 
lar cloning of bfut7 gene. The PCR products were then 
sequenced with each primer using the Big Dye Termin- 
ator vl.l kit (Applied Biosystem) on a 3100 genetic 
analyzer (Applied Biosystem). 
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Expression of recombinant bFucT-VII and 
fucosyltransferase assays 

A DNA fragment including the complete coding se- 
quence was amplified by PCR from thymus cDNA using 
primers fut7f 3 and fut7r 6 and the same PCR conditions 
described in molecular cloning of bfut7 cDNA. The frag- 
ment was then cloned into the pFlag CMV-2 mamma- 
lian expression vector (Sigma) between EcoRI and Kpnl 
restriction sites. COS-1 cells were transfected with 6 \ig 
native or recombinant expression vector with FuGENE 6 
Transfection Reagent (Roche). After 48 h, cells were 
trypsinized and proteins were extracted from pelleted 
cells with a lysis buffer containing 10 mM sodium caco- 
dylate, pH6, 20% (v/v) glycerol, 1 mM DTT and 1% (v/v) 
Triton X100. 

Enzyme assays were performed in 50 \A containing 
25 mM sodium cacodylate pH 6.5, 5 mM ATP, 20 mM 
MnCl 2 , 10 mM a-L-fucose, 3 [iM GDP-[ 14 C]-fucose 
(310 mCi/mmol), 0.1 mM acceptor substrate. After 2 h 
incubation, reactions were stopped by adding 1 ml water 
and applied to a conditioned Sep-Pak C 18 reverse chro- 
matography cartridge (Millipore Corp., MA). After a 
15 ml H 2 0 wash, products were directly eluted by 10 ml 
of methanol into scintillation vials and counted with one 
volume of Instagel (Packard, IL) in a liquid scintillation 
beta counter (TRI-CARB 2100 TR, IL). 

Bioinformatics 

To identify putative promoter sequences within a 
1000 bp region upstream from the start codon, geno- 
matics software tools were used (www.genomatics.org), 
applying a threshold of 94% similarity. Sequences were 
extracted from NCBI databases (Equus caballus: 
NW_001867391.1, Homo sapiens: NW_00 1839245.1, 
Mus musculus: NW_001030686.1, Didelphis domestica : 
traces gnl|ti| 474330053 and gnl|ti| 526972938, Bos taurus 
gi| 269932530:97318-108050, Xenopus tropicalis Ensembl 
scaffold:JGI_4.2:GL173522.1:48484:50721:-l and Gallus 
gallus gi| 118099516:288657-388656). The access num- 
bers for Fuc-TIV, Fuc-TIX and Fuc-TIII in western 
clawed frog, chicken, and human were the same as the 
previous paper on vertebrate FUT phylogeny [14]. Align- 
ments of 14 amino acid sequences were obtained using 
the program MUSCLE (Multiple Sequence Alignment) 
available at: http://www.ebi.ac.uk/Tools/msa/muscle/. 
Among the 576 sites, uninformative positions mainly 
contained in the stem region of the molecule were 
removed, and 276 were retained. 

Phylogeny was conducted through the Maximum Like- 
lihood method implemented in MEGA5.0 [37]. The best 
choice of substitution pattern between amino acids [38] 
followed the calculation of BIC (Bayesian Information 
Crirerion). The robustness of branches was tested with 
1000 bootstrap replicates. 
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